Genetic Algorithm Wrappers For Feature Subset Selection In Supervised Inductive Learning
نویسندگان
چکیده
1. Inferential loss: Quality of the model produced by an inducer as detected through inferential loss evaluated over a holdout validation data set Dval ≡ D \ Dtrain 2. Model loss: “Size” of the model under a specified coding or representation 3. Ordering loss: Inference/classificationindependent and model-independent measure of data quality given only training and validation data D and hyperparameters ÿ
منابع مشابه
Control of Inductive Bias in Supervised Learning using Evolutionary Computation: A Wrapper-Based Approach
In this chapter, I discuss the problem of feature subset selection for supervised inductive learning approaches to knowledge discovery in databases (KDD), and examine this and related problems in the context of controlling inductive bias. I survey several combinatorial search and optimization approaches to this problem, focusing on datadriven validation-based techniques. In particular, I presen...
متن کاملGenetic Algorithms for Reformulation of Large-Scale KDD Problems with Many Irrelevant Attributes
The goal of this research is to apply genetic implementations of algorithms for selection, partitioning, and synthesis of attributes in largescale data mining problems. Domain knowledge about these operators has been shown to reduce the number of fitness evaluations for candidate attributes. We report results on genetic optimization of attribute selection problems and current work on attribute ...
متن کاملGenetic Wrappers for Constructive Induction in High-Performance Data Mining
We present an application of genetic algorithmbased design to configuration of high-level optimization systems, or wrappers, for relevance determination and constructive induction. Our system combines genetic wrappers with elicited knowledge on attribute relevance and synthesis. We discuss decision support issues in a largescale commercial data mining project (cost prediction for multiple autom...
متن کاملA Parallel Genetic Algorithm Based Method for Feature Subset Selection in Intrusion Detection Systems
Intrusion detection systems are designed to provide security in computer networks, so that if the attacker crosses other security devices, they can detect and prevent the attack process. One of the most essential challenges in designing these systems is the so called curse of dimensionality. Therefore, in order to obtain satisfactory performance in these systems we have to take advantage of app...
متن کاملA Parallel Genetic Algorithm Based Method for Feature Subset Selection in Intrusion Detection Systems
Intrusion detection systems are designed to provide security in computer networks, so that if the attacker crosses other security devices, they can detect and prevent the attack process. One of the most essential challenges in designing these systems is the so called curse of dimensionality. Therefore, in order to obtain satisfactory performance in these systems we have to take advantage of app...
متن کامل